The pace of evolution across fitness valleys 
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How fast does a population evolve from one fitness peak to another? We study the dynamics of 
evolving, asexually reproducing populations in which a certain number of mutations jointly confer 
a fitness advantage. We consider the time until a population has evolved from one fitness peak to 
another one with a higher fitness. The order of mutations can either be fixed or random. If the 
order of mutations is fixed, then the population follows a metaphorical ridge, a single path. If the 
order of mutations is arbitrary, then there are many ways to evolve to the higher fitness state. We 
address the time required for fixation in such scenarios and study how it is affected by the order of 
mutations, the population size, the fitness values and the mutation rate. 



I. INTRODUCTION 

Evolutionary dynamics is based on natural selection, 
mutation and genetic drift [1[ . It can be illustrated as the 
dynamics of a population in an abstract, typically high- 
dimensional fitness landscape. Since individuals with 
higher fitness produce more offspring, the average den- 
sity of individuals is highest close to the fitness maxima. 
Many such features as the stationary population den- 
sity in the fitness landscape or the mutation rate under 
which a population can still be concentrated around a fit- 
ness maximum have been addressed ^I^W . An important 
question is how a population evolves towards a fitness 
peak via several intermediate states. If the intermedi- 
ate states have the same fitness as the initial state, then 
evolution to higher fitness states is neutral at first and 
thus poses no significant problems [S]. If the intermedi- 
ate states have lower fitness than the initial state, then a 
fitness valley has to be overcome and it is more difficult 
to reach the fitness peak [Hl^. In this case, population 
stuck on a local peak cannot escape by natural selection 
alone, because there is no evolutionary trajectory with 
successively advantageous mutations. Instead, neutral 
genetic drift becomes important. 

Here, we consider the dynamics of these systems from 
a different perspective. We address the average time a 
population needs to transfer from one peak to another 
one. For small mutation rates and finite populations, we 
calculate this average time analytically. When mutation 
rates are high, we can describe the system by a set of 
differential equations and obtain the relevant times from 
a numerical integration of the differential equations. In 
this framework, the relevant question is how fast a pop- 
ulation evolves [Hj. 

In particular, we can address the question whether a 
population evolves faster from one peak to another via d 
mutations if 

(i) mutations have to occur in a certain order, i.e. only 
a single evolutionary trajectory is available or 



(ii) the order of the mutations does not matter, i.e. 
there are d\ evolutionary paths. 

In the simplest case the intermediate fitness values are 
identical in both the cases and equal to that of the initial 
state. Thus the only difference remaining is the number 
of available paths. When the order of mutations is not 
fixed then multiple paths are available and the evolution- 
ary dynamics will be faster when compared to a single 
path. We can then ask the question: Does a population 
evolve faster on a narrow ridge or a broad valley? This 
implies that we move away from the simplest case men- 
tioned above and decrease the fitness in the intermediate 
states of the multiple paths compared to the fitness in 
the intermediate states of the single path. We show how 
the pace of evolution depends on the depth of the val- 
ley, the number of intermediate states and the size of the 
population. 

In general, evolutionary dynamics depends crucially on 
the size of the population. In a small population a single 
mutation will typically reach fixation or extinction be- 
fore another mutation can arise. The population moves 
as a whole step by step on the fitness landscape. For 
large populations, even for small mutation rates usually 
multiple types arise at the same time. This results in a 
non-zero population density in many states at the same 
time. For intermediate mutation rates, the population 
can either move stepwise across the fitness landscape or 
move several stpdf without getting concentrated in one 
of the intermediate states. This phenomenon has been 
termed stochastic tunneling lOj. If the mutation rates 
are too small, tunneling does not occur because it is un- 
likely that a second mutation arises before the first one 
has reached fixation or has gone extinct. If the mutation 
rates are high, tunneling occurs trivially, because the sys- 
tem can be approximated by differential equations for the 
densities in the different states. These different scenarios 
including the limiting cases of stepwise evolution (typi- 
cal for small populations) and continuous evolution (typ- 
ical for large populations) can also be observed when the 
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population size is kept constant, but the mutation rates 
are increased. For computer simulations increasing the 
mutation rate is more convenient than simulating huge 
populations for moderate mutation rates. 

One important example for an evolutionary process in 
which the timescales are of crucial importance is the so- 
matic evolution of cancer [llj. Cancer progression has 
been investigated mathematically since the 1950s [TH- 
[H] , Of special interest are the tumor suppressor genes 
[inillS]. In a normal cell, there are two alleles of the tu- 
mor suppressor gene. The mutation in the first allele is 
neutral if the second wild-type allele can sufhciently per- 
form the function. Inactivation of both the alleles confers 
a selective advantage to the cell and can lead to cancer 
progression. This is an example in which the order of 
mutations does not matter. Many cancers also require 
certain particular mutations that initiate cancer growth 
and pave the way for the accumulation of further muta- 
tions [17]. Recently, it has been shown that after cancer 
initiation, a large number of different mutations may be 
involved in cancer progression [TSHU]. So far, it is un- 
clear if the mutations have to occur in a specific order or 
if there is more variation in the order [531 US] • 

For simplicity, we consider only very simple fitness 
landscapes here in which the fitness in all the intermedi- 
ate states is identical. In natural systems, these fitness 
values will differ and also the mutation rate may not be 
constant. In addition, sometimes the order of mutations 
will matter and sometimes, it will not. Thus, sometimes 
a particular mutation will be a prerequisite to obtain a 
new function, but sometimes new mutations do not re- 
quire any prerequisites. For example, this is the case in 
the evolution of resistance to /3 lactam antibiotics studied 
by [24] and [7]. However, here we focus on a very simple 
model to highlight the general aspects of the dynamics 
by analytical and numerical considerations. 

This paper is organized as follows. We begin with the 
description of the two ways to order the mutations, the 
single path and the hypercube. We then derive analytical 
approximations of the fixation times for small mutation 
rates and discuss the effect of the different parameters 
on the fixation times. Next, we address the dynamics 
for intermediate and high mutation rates. Finally, we 
explore biological examples which can be modeled using 
this approach. 



II. MODEL 

To model evolutionary dynamics in a haploid popu- 
lation of size N, we use the Moran process [T] (SFj. In 
each time step, one individual is selected at random, 
but proportional to fitness. It produces one offspring, 
which replaces a randomly chosen individual. In one 
generation, each individual reproduces on average once. 
During reproduction, mutations occur with probability 
jj,. We are interested in the time it takes until d mu- 
tations reach fixation in the population, starting from a 



homogeneous population in the initial state without any 
mutants. Moreover, we aim to explore the dynamical 
features of this process. We restrict ourselves to two dif- 
ferent cases that allow the derivation of some analytical 
results. 



A. Single path 

If the mutations can occur only in a particular order, 
we have a single evolutionary path, see Fig. 1 for an illus- 
tration. Individuals in the initial state have fitness tq = 1 
and individuals in the final state have fitness > 1. It 
is instructive to characterize an individual by a string of 
d sites, which can either be wild-type or mutated. If the 
order of mutations is fixed, then a particular mutation 
requires another particular mutation as a prerequisite. 
For simplicity, we assume that all the d~ 1 intermediate 
states have the same fitness rj = s < Vd {j — 1, - ■ ■ , d—1). 
For s < 1, the joint effect of the set of mutations make up 
for the loss of fitness caused by the individually deleteri- 
ous mutations. This can be considered as a very special 
case of epistasis [26] . 



B. Hypercube 

If the order of mutations does not matter, evolutionary 
dynamics takes place on a hypercube in d dimensions cf. 
Fig. 1. Thus, there are 2^^ different types of individuals. 
In the initial state, we have d possible mutations. In the 
next step, d — 1 mutations are available. Consequently, 
we have dl possible paths to fixation. Again, we assume 
To = 1 and rd > 1. Further, all individuals with some, 
but not all mutations have fitness s < r^. 




FIG. 1: The order of mutations determines the geometry 
for evolutionary dynamics, shown here for d = 3 sites (e.g. 
genes, nucleotide sites etc.). If mutations can only occur in a 
particular order, only a single path is available (left). If the 
order of mutations is arbitrary, evolutionary dynamics occurs 
on a hypercube (right). The initial states have fitness 1 and 
the final states fitness r ^ 1. All intermediate states are 
assumed to have the same fitness s < r. States are labeled by 
bit-strings, is an wild-type site and 1 is a mutated site. 
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If the mutation probability is small, we do not need 
to make specific assumptions on the mutation process. 
But when the mutation probability increases, we can no 
longer be certain that only a single mutation occurs dur- 
ing reproduction. For simplicity, we do not consider the 
possibility of backward mutations. Although back muta- 
tions are often relevant, especially to escape from evolu- 
tionary dead ends [27^ , it is not straightforward to define 
the speed of evolution in a system with backward muta- 
tions. This is due to the fact that for sufficiently high 
mutation rates, fixation in the final state might never 
occur. Other definitions of the end state of the system 
become arbitrary to a certain extend. The probability 
Urn^m+k that the offspring of an individual with m mu- 
tations has m + k mutations {m < m + k < d) is 



d — m 
k 



d—rn—k 



(1) 



This equation is valid for the hypercube, where the order 
of mutations does not matter. Here, C'^'") is the number 
of different types of mutants with k additional mutations, 
jJ' is the probability that mutations occur at k sites and 
(1 — /^)'*^™~'^ is the probability that no mutation occurs 
at the remaining d — m — k sites. For the single path, 
there is only one possibility to arrange the m + k muta- 
tions. Thus, for fc > 0, Um^m+k is identical to Eq. ([T]), 
except that the binomial factor has to be dropped. The 
probability Um->m that no mutation occurs follows from 
normalization, u^a^m — ^~ X]fc=i' ""m-j-m+fc- Our ana- 
lytical calculations for small mutation rates as well as the 
considerations for high mutation rates are independent of 
the precise form of the mutation rates. However, we need 
to specify the form of the mutation probabilities to per- 
form our numerical simulations for intermediate and high 
mutation rates. 



III. SMALL MUTATION RATES 
A. The pace of evolution for small mutation rates 

For small mutation probabilities, double mutations can 
be neglected. Since mutations occur rarely, we can cal- 
culate the average time until d mutations are fixed in the 
population analytically. Let us first address the evolu- 
tionary dynamics when mutants with fitness are al- 
ready present in a resident population of fitness r^,, but 
no new mutations occur. This scenario is relevant when 
mutation rates are sufficiently small. The probability to 
increase the number of mutants from j to j -I- 1 is 



y+ ^ -r^j N~j 



(2) 



Similarly, the number of mutants decreases from j to j~l 
with probability 



^_ ^ r^{N-j) j 

^ TraJ +ru,{N ~ j) N' 



(3) 



The probability that k mutants take over the entire pop- 
ulation is given by [H [28H30] 



1 + rij^i 
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1 + rifci ^ 
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N ■ 



(4) 



If a mutant reaches fixation, the average number of gen- 
erations this process takes is given by [311 [32] 



Tfix 
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For a neutral process with = this reduces to 
Tfix = N—1. For sufficiently large N , this is the maximum 
conditional fixation time of a mutant. Even for disadvan- 
tageous mutants (r^ < r^,) the conditional fixation time 
is smaller than — 1 [32] . Since there are mutations 
per generation, the time between two mutations is 

Thus, for fji <C N~'^ a mutant reaches fixation before the 
next one arises and mutations will not occur when a mu- 
tant is already present. Thus the population evolves by 
a process where the mutations occur one after the other, 
which has been termed periodic selection |33j and theo- 
retically described as the strong-selection weak-mutation 
regime [Ml [55] . 

The total time r until a mutation reaches fixation in 
a population is the sum of the waiting time until a suc- 
cessful mutant occurs and the fixation time of the mutant 
T = ■'wait +'''fix- The waiting time is the inverse of the mu- 
tation rate divided by the probability that a particular 
mutant is successful. 



'wait 



1 



1 



(6) 



For — > 0, we have r^ait — ^ oo, but rgx remains approx- 
imately constant. Thus, r r^ait for small mutation 
rates. In principle, we could calculate in the presence 
of mutations. But since our approximation is only valid 
for small mutation rates, this will be a minor correction. 

For ^ ^ A^"^, the population is homogeneous most of 
the time. Only occasionally, a mutant arises and reaches 
fixation or goes to extinction. The total time until d 
mutations are fixed in the population is the sum of the 
waiting times for the successful mutants plus the time of 
the d fixation events. For a single path with initial fitness 
1, intermediate fitness s and final fitness r, we find for 
the total time 

= T„ait (s) + {d- 2)Twait (1) + Twait {r / s) (7) 

+ Tfix (s) + {d- 2)Tfix (1) + Tfix (r/s) . 

For small ^, we have Tfix Twait £^nd hence the total 
time can be approximated by 
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d-2 



1 



NMr/s) 



(8) 
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Consider now a "fitness valley" , in which the intermedi- 
ate states have fitness s < 1, but the final state has fitness 
r > 1. To move from the fitness peak in the initial state 
to the fitness state in the final state, first a disadvanta- 
geous mutation has to be fixed in the population. Since 
01 (s < 1) ^ ^, the waiting time of such an event is 
very long. The waiting time for the neutral mutations, 
Twait (1) = ^ ^-nd the waiting time for a successful mu- 
tation, Twait {t/s) are significantly shorter. Thus, is 
dominated by ^^^^^^^^ for s < 1 and sufficiently high N in 
a fitness valley. Fig. 2 shows a good agreement between 
exact numerical simulations and our analytical approxi- 
mation for small mutation rates Eq. ([s]). 

If the order of mutations is arbitrary, evolutionary dy- 
namics occurs on a hypercube. In this case, the whole 
process will be faster, as we have d\ possible paths instead 
of a single one. Now, the waiting times in the different 
states depend on the number of mutations that are still 
available. For the total time r^, we obtain, 

1 '^^^ 1 r 

k=l * 
+ Tfix (S) + (rf - 2)Tfi^ (1) + Tfi^ (^) 

Note that the time of the fixations alone is identical for 
the hypercube and the single path. Neglecting these fix- 
ation times for small /i ( as Tyj^ ^ T^ait) yields 
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(10) 



1 = d-2 



it is obvious that < , i.e. evolutionary dynamics is 
faster if the order of mutations is arbitrary. For fitness 
valleys with s < 1 and a large population size, is dom- 
inated by ^^jv0i(.!i) • more mutations are available, 
this is always faster than the corresponding equation for 
a single path, see Fig. 2. 



B. Thresholds of the waiting times 

Next, we derive expressions for some interesting 
thresholds of the waiting times in the limit of small muta- 
tion rates. Since evolutionary dynamics is always faster 
if many paths are available, we now compare a fitness 
valley in which many paths are available to a single path 
in which the order of mutations is important, but fitness 
does not decrease in the course of evolution. The basic 
question we address here is, whether it is faster to cross a 
broad valley or a narrow ridge in fitness space. In other 
words, we compare t^{s = 1) to t^(s < 1). Since we 
consider only small mutation rates /j,, we neglect the fix- 
ation times Tfix here, although they will not be identical 
in the two scenarios. For s = 1, the single path is neutral. 
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FIG. 2: Fixation time for a single path (squares) and a hy- 
percube (circles) with small mutation rates <^ N~'^) for 
different intermediate fitness values. Evolutionary dynamics 
is always faster in the hypercube. Solid lines show the analyti- 
cal approximation for small mutation rates, Eqs. ([sjl and ( 10 1. 
Numerical simulations shown by symbols agree well with the 
analytical approximation (population size A*' — 100, mutation 
rate /i = 10~^, d — 5, r = 1.1, simulations averaged over a 
1000 realizations). 



We decrease s in the hypercube until we have identical 
waiting times. This yields an implicit expression for s. 
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From this equation, we can numerically determine s for 
any given N. For large TV, Eq. (Ill simplifies to 
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(12) 



where we used (1 — x/N)~ — > for large N . Thus, the 
quantity A^(l — s) becomes constant for large iV, see Fig. 
3. Thus, we can now say how broad and deep a fitness 
valley has to be to lead to the same cumulative waiting 
time as a single neutral path. 

Next, we address the effect of the intermediate fitness 
s, which has an important influence on the cumulative 
waiting time r. Fig. 2 shows how the waiting time de- 
creases with increasing fitness in the intermediate states 
s. If s comes very close to the fitness in the final state, 
the waiting time increases. This increase is seen both in 
the single path and the hypercube. An increase in the 
intermediate state fitness will not always lead to a reduc- 
tion in waiting times. Instead, the fixation times reach 
a minimum when the fitness growth is constant between 
any two consecutive states [9l|36]. For the hypercube, 
the fastest trajectory will be steeper than on a single 
path: at first, many mutations are available and a big 
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FIG. 3: The figure shows the threshold values for which evo- 
lution on a hypercube with fitness s < 1 in the intermediate 
states proceeds as fast as on a single neutral path with s = 1. 
Full lines show A^(l — s) based on the numerical solution of 
Eq. Above the lines, evolution proceeds faster on the 

hypercube. Below them, the neutral single path is faster. 
For — >■ CO, the lines converge to a constant, see Eq. (12 1. 



The symbols show the results from numerical simulations for 
N = 5 and d = 10 (triangles), N = 20 and d = 5 (circles) 
as well as TV = 80 and d = 2 (squares). Symbols are open 
(red) when the single path is faster and filled (blue) when the 
hypercube is faster {fj, = 10~^, r = 1.1, simulations averaged 
over a 1000 realizations). 



fitness increase is not necessary. Later, fewer mutations 
are available and thus, the fitness should increase faster. 
The precise form of the trajectory will in this case depend 
on the number of mutations d and the population size N. 
We note that a similar reasoning can be applied to con- 
struct a fitness landscape that allows to cross a fitness 
valley fastest: the fastest trajectory has the same form 
regardless if a fitness peak is approached (r > 1) or a fit- 
ness minimum is approached (r < 1). Thus, the fastest 
way to cross a fitness valley is to descend to the minimum 
with exponentially decreasing fitness and to increase from 
the minimum again with exponentially increasing fitness. 

Now, we turn to the effect of the intermediate fitness s 
on the individual waiting times. Eqs. ([8| and (10) both 
consist of three terms each. The first term denotes the 
time required to leave the initial state. The second term 
is the time spent in moving through all the intermediate 
states. This second term is independent of s, because the 
transitions are neutral. The last term denotes the time 
required to reach the ultimate state from the penultimate 
state. For small values of s, the probability to fixate the 
disadvantageous mutation is very small. Thus, the total 
time is dominated by the first term. When s is increased 
to a threshold value si, then the time for leaving the first 
state is identical to the waiting time in the intermediate 
states. For the hypercube, si is given by ^''wait (si) = 



J2kJi (1), which reduces to 



^ d-k' 



(13) 



This equation can be solved numerically for specific val- 
ues of N and d. For the single path, the right hand side of 
this equation has to be replaced by N{d—2). For s > Si, 
the time to cross the intermediate states is larger than 
the waiting time in the first state. On the hypercube, 
we can define a second threshold for which the waiting 
time in the first state is the same as the time required 
to reach the final state from the penultimate state. This 
arises because the effective mutation rate in state is d 
times larger than the effective mutation rate in state d—1. 

The threshold S2 is given by ^Twait (s2) = 7"wait (j;) or 



d 1 



1 - 



(14) 



Again, S2 has to be determined numerically. For a single 
path, the factor d^^ in Eq. ( 14) has to be dropped. Thus, 



the threshold S2 occurs for s > 1 and is simply given by 
32 = Vr. 

The fixation time is also strongly infiuenced by the 
number of mutations d. A larger d increases the length 
of the path and usually also the fixation times. For the 
single path, this increase results only from the increase 
in the time required to cross the intermediate states, be- 
cause the time for leaving the initial state and the time 
to reach the final state from the penultimate state are in- 
dependent of d. The time required to reach the ultimate 
state from the penultimate state is also independent of 
d for the hypercube, but the time required to leave the 
initial state decreases with increasing d. This is because 
as d increases, there are more states available in the first 
error class and thus the effective rate of mutation out of 
the initial state increases. As for the single path, the time 
to cross the intermediate states increases with d in the 
hypercube. For the hypercube, this interplay can lead to 
a non-monotonic dependence of the fixation time on d. 
For example, for N = 100 and s — 0.95, the fixation time 
decreases with d for d < 31, but it increases with d for 
d > 31. In contrast, the fixation time always increases 
monotonically with d for the single path. 

Increasing the fitness of the final state r increases the 
advantage of the final state over the intermediate states. 
This will result in the decrease in the time required for 
the population to make the last move. Increasing r has 
no effect on the time required to cross the intermediate 
states or the time required to move away from the initial 
state. As a result, those two times remain constant even 
as r increases, both in the single path and the hypercube. 
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FIG. 4: The probability of tunneling across the hypercube 
(circles, blue) is larger than in the single path (squares, red) 
due to the higher effective mutation rate. The tunneling 
across the valley denoted by the filled symbols (s = 0.9) is 
always larger than the probability of tunneling across a flat 
fitness landscape denoted by open symbols (s = 1.0). This 
arises from the fact that the stepwise accumulation of muta- 
tions would involve the fixation of a disadvantageous mutation 
in the first step. All symbols show the probabilities that the 
population tunnels at least across one state for the single path 
or at least one error class for the hypercube (A'^ = 100, d — 5, 
r = 1.1, averaged over a 1000 realizations). 



adaptation, but recently it has been shown that it can 
have a positive influence on a rugged fitness landscape 

The states in a single path can be characterized by the 
number of mutations. In the hypercube, the states are 
characterized by the types of mutations that have already 
occurred. Thus, there are many different types that have 
undergone a specific number of mutations. However, all 
types that have already accumulated k mutations can be 
pooled into the error class k. The number of different 
types in the error class k is given by (^) = j^^jzrky - 

In a single path, a population is said to tunnel across 
a state if it passes through a state without ever reach- 
ing fixation in that state. Analogously, in a hypercube a 
population said to tunnel across an error class if it passes 
through that error class without ever reaching fixation in 
it. Within the error classes, tunneling can occur across 
individual states, but also across several states at once. 
This means that the whole population passes only across 
that particular state and not across any other, without 
ever reaching fixation in that particular state. Tunneling 
across an error class can also occur in a second way: the 
population can use all of the available states in the er- 
ror class, but the total number of individuals in the error 
class never reaches N. Thus, the probability of tunneling 
via the individual states is always lower than the proba- 
bility of tunneling across the error classes. Fig. 6 shows 
the relation between the different probabilities of tunnel- 
ing in the hypercube with respect to the rate of mutation 



IV. INTERMEDIATE MUTATION RATES 



The analytical approach is only valid as long as the 
mutation rate is small, /i <C N^"^. For higher mutation 
rates, the population does not have to consist of at most 
two different types at any time. Instead, d mutations can 
be fixed in the population without sequentially fixing one 
after the other. This process has been termed stochastic 
tunneling and is of great importance in the context of 
cancer initiation [TUl [111 HH IS?]- Tunneling across fit- 
ness valleys is more likely than tunneling across a flat 
fitness landscape (see Fig. 4). Even for d = 2, the evolu- 
tionary dynamics is characterized by a doubly stochastic 
process, which makes analytical approaches tedious lUj. 
As discussed above, for fj,N^ <^ 1 the population usually 
contains at most two different types. In this case, the 
probability of stochastic tunneling will be very small. On 
the other hand, for iiN > 1, at least one mutant is pro- 
duced per generation. Thus, the probability of stochastic 
tunneling approaches 1. For N~'^ < fj, < N~^, the mu- 
tations are sometimes fixed sequentially and sometimes 
via stochastic tunneling. Fig. 5 shows how the tunneling 
probability increases from to 1 in this interval. 

For intermediate mutation rates, it is likely that the 
population contains more than two different types. The 
types with beneficial mutations will compete for fixation. 
This process is termed clonal interference [SHI f5Hti41j . 
Clonal interference has been considered to slow down 
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FIG. 5: The probability of tunneling across a neutral hyper- 
cube (circles) is always higher than the probability of tunnel- 
ing across a neutral single path (squares). Here, the proba- 
bility that the system tunnels across at least one state or one 
error class is shown for d — 5 and N = 1000. As expected, 
for Nfi > 1 the probability of tunneling approaches 1. In 
contrast to our conservative estimate that tunneling can be 
neglected only as long as the mutation rate is below , 
even for mutation rates as large as lOOA''"'^ the probability of 
tunneling remains close to zero (s = 1.0, r = 1.1, averaged 
over a 1000 realizations). 
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fi for the special case d = 2. 
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FIG. 6: For d — 2, the probability of tunneling across the in- 
termediate state is shghtly higher in the single path (squares) 
than in the hypercube (open circles), shown for A'' = 100 here. 
This is because the effective mutation rate into the intermedi- 
ate state is twice as big in the hypercube, leading to a higher 
probability of fixation. Filled circles show the probability to 
tunnel across individual states of the hypercube. For fiN > 1, 
the system always tunnels. In the hypercube, both states are 
used for this. As expected, we need fi ^ A''"^ for the tunnel- 
ing probability to vanish (s = 1.0, r — 1.1, averaged over a 
1000 realizations). 



Due to higher effective rates of mutation, the prob- 
ability of tunneling across a hypercube is expected to 
be greater than or equal to the probability of tunneling 
across a single path. However, numerical simulations re- 
veal that for d — 2 the probability of tunneling in a single 
path is higher than in the hypercube. This is a special 
case: for d = 2 in a hypercube, the number of states into 
which the initial state can mutate into is 2. The effective 
rate of mutations is thus twice as much as in the single 
path. The number of states which can be mutated into 
next is one, both in the single path and the hypercube. 
Thus the rate at which the individuals are pushed into 
the first state is higher in hypercube than in the single 
path while the rate of individuals being pushed out is 
the same. Thus there is a higher probability of reaching 
fixation in the first error class in a hypercube (see Fig. 
6). We only observe this effect for d = 2, for d > 2, the 
probability of tunneling is higher in a hypercube than in 
a single path, as expected (see Fig. 4). 



V. HIGH MUTATION RATES 

For fj,N > 1, the stochastic features of the dynamics 
become less important, fn this case, the system can be 
described by a set of d+1 deterministic differential equa- 
tions for the fraction Xk{t) of the population that has 
k mutations [33]. Obviously, we have X]fc=o^fc(^) ~ 



Transitions out of state occur with probability To_> = 
(l-^wo^o)a;o, where = Xo + il-Xo-Xd) s+Xdr is the 
average fitness of the population. This includes all the 
reproductive events except for the one where a type is 
produced. Transitions into state occur with probability 
T^o = ^uo-i.o(l ~ ^q)- Thus, the fraction of individuals 
in the initial state follows the differential equation 



ioit) = ^ 



2^0 



Xq 



uo^o{l - xo) - (1 - '---Uo^o)xo 



(15) 



The probability that an offspring is of type k is given 



by 



Afe — X]j=o ~ljr~^j^'=- "^^^ difference between the 
hypercube and the single path only occurs in the quantity 
Uj^k, which is given above for both cases. The sum in 
Afe is over all individuals with k or less mutations and rj 
is the fitness of individuals with j mutations. This leads 
to the differential equation for the fraction of individuals 
with k mutations, 

Xk{t) = [Afe(l - Xk) - (1 - \k)xk] , (16) 



d. Of course, the special case fc = 
(15). This set of d -I- 1 differential equa- 



where k — 0, 
recovers Eq, 

tions describes how the system moves from state fc = 
to state k — d. In general, only a numerical solution of 
this system of equations is feasible. While this allows 
us to infer details of the dynamics, our main interest is 
the time required for fixation of d number of mutations. 
Thus, we solve the differential equation numerically us- 
ing a standard Runge-Kutta algorithm [33]. To find an 
equivalent to the fixation time in a stochastic simulation, 
we average between fixation {xd = 1) and the time when 
there are on average less than 1 individuals outside the 
final state {xd = 1 — jf)- Thus, the fixation time is the 
time when the solution of the differential equation crosses 

Xd = f — 2lV' 

Fig. 7 shows an overview of the fixation times, cover- 
ing the full range of mutation rates. For small mutation 
rates, we have sequential fixation of mutations and the 
time can be well approximated by Eqs. ^ and ( 10 ). For 
high mutation rates, the numerical solution of Eq. ( 16 ) 
leads to a good approximation for the fixation times. 



VI. DISCUSSION 

We have determined the average time during which a 
population moves from a certain initial state to a final 
state of higher fitness. The initial and the final states 
are separated by a fixed number of mutations d. The 
mutations jointly confer a fitness advantage to the final 
mutant which can be represented by a peak in the fitness 
landscape. If the intermediate mutations need to occur in 
a specific order for the evolution of the final mutant then 
it corresponds to the single path. Otherwise, evolution 
occurs on a hypercube and there are d\ ways of reaching 
the final state. 
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FIG. 7: The fixation times decrease with increasing mutation 
rate. Fixation always occurs faster on the hypercube (circles) 
than in the single path (squares). For small mutation rates, 
mutations fixate sequentially and the fixation time can be well 
approximated by Eqs. ([sjl and (10 1. Here, the fixation times 



decrease as fi~ . For high mutation rates, the system can be 
approximated by a set of deterministic differential equations 
and the simulation results for the fixation times can be ap- 
proximated based on the numerical solution of Eq. (16 1. In 
this case, fixation times decrease in general slower than fi~^ 
with increasing mutation rate (population size A'^ = 1000, 
d = 5,s = l,r = l.l, averages over 1000 realizations). 



We have explored the simplest system in which the 
fitness in all intermediate states is the same. As ex- 
pected, the fixation times on a hypercube are shorter 
than on a single path, due to the presence of multiple 
paths available in a hypercube. This observation leads 
to the question: for which parameters does the hyper- 
cube show shorter fixation times than the single path, 
even with an added disadvantage? The fitness in the in- 
termediate states was then set to lower values than the 
ones in the single path. Up to a certain threshold value 
of the fitness of the intermediate states, the hypercube 
shows shorter fixation times than in the single path. The 
value of the threshold depends on the population size, 
total number of required mutations and the fitness in the 
final state. 

The fixation times for large populations largely depend 
on the fitness function and are qualitatively independent 
of the order of mutations. Let us first focus on a flat land- 
scape: when the intermediate states have a fitness equal 
to the fitness of the initial wild-type, then for small mu- 
tation rates large populations have shorter fixation times 
than small populations. This is because the neutral rate 
of evolution does not depend on the population size. But 
the waiting time for fixation of the last mutation becomes 
shorter with larger population size. For intermediate mu- 
tation rates, tunneling starts earlier in larger populations. 
This leads to a marked decrease in the fixation time with 
larger population size. For high mutation rates, the time 
to fixation is no longer dominated by the time for the first 



mutant to reach the final state, but by the time until all 
individuals are in that state. Due to this, for high mu- 
tation rates the time required for fixation can be shorter 
in smaller population as compared to larger populations. 
Next, we focus on fitness valley: If the fitness landscape 
consists of a valley with reduced fitness of the intermedi- 
ate states, small populations have an advantage for small 
mutation rates, as they can easily leave the initial state 
and enter the valley. But for high mutation rates, large 
populations reach fixation faster, because they can ex- 
plore states within and beyond the fitness valley more 
easily. 

Our numerical simulations reveal that tunneling can 
be neglected even when the mutation rate exceeds N~'^, 
at least by one order of magnitude. Thus, Eqs. ([8| and 
( 10 1 provide good estimates for the fixation times even 
in relatively large populations. Concrete values for fix- 
ation times are collected in Table 1. They reveal that 
even in long-term studies of experimental evolution, it 
is difficult to observe the consecutive fixation of neutral 
mutants |45) . Consecutive fixation of advantageous mu- 
tants, however, is significantly faster. For example, while 
Table 1 reveals a fixation time of ^ 10^"'^ generations on 
a single path for d = 10, s = 1 and N = 10^, an optimal 
choice of the intermediate fitness values [9] would lead to 
a fixation time of ^ 10*" generations. 

d^3 d = 10 

Single Path Hypercube Single Path Hypercube 



10^ 
10* 



2.10999 
2.0011 



W 2.00001 



0.943325 
0.834433 
0.833344 



9.10999 
9.0011 
9.00001 



2.03896 
1.93007 
1.92898 



TABLE L The time required for fixation of d mutations in 
units of 10^" generatio ns f or a mutation rate oi fi = 10^^" 
based on Eqs. (|8| and (10 1. The intermediate mutations are 
neutral, s = 1. For small mutation rates, the fixation times 
scale linearly with /i^^. For A'' — > oo, the fixation time on the 
single path approaches fJ-^^{d — 1) and the fixation time on 



the hypercube approaches ^k=o('^~ k)~^. However, the 
mutation rates have to decrease with increasing N to make 
the approximation for the fixation times valid, (initial fitness 
1.0 and final fitness r — 1.1). 



While we have focused on the simplest possible sys- 
tem which allows analytical approximations, experimen- 
tal studies reveal of course a much higher complexity. 
[7] studied experimentally the point mutations in the 
/3-lactamase gene of bacteria. /? lactam antibiotics are 
commonly used, but the bacteria can develop resistance 
to the drugs. Five point mutations in a particular allele 
of the /3-lactamase gene increases the resistance of the 
bacteria to cefotaxime by a factor of ~ 100,000. Theo- 
retically the mutations leading from the wild-type allele 
to the resistant allele can occur in 5! ~ 120 ways. These 
can be represented by a hypercube of d = 5. But in only 
18 of the 120 trajectories, the intermediate mutations are 
either neutral with respect to the initial state or bene- 
ficial. Weinreich and colleagues have shown that these 
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have the highest probabihty of reahzation. For all bene- 
ficial intermediates the fastest way to reach the final state 
would be when the relative fitness increase between any 
two consecutive mutations is constant f5^, but usually in 
nature several different mutations are available and the 
population first evolves to states that provide the highest 
selective advantage. 

In another experimental study the sequence space of 
the 5s rRNA of a marine bacterium, Vibrio proteolyticus 
was explored |46| . The sequences from Vibrio proteolyti- 
cus and Vibrio alginolyticus differ in only four positions. 
All the possible intermediates were constructed by the 
authors and the fitness of each was calculated [17]. Two 
of the valid intermediates have a fitness lower than the 
initial wild-type. We have shown how such fitness valleys 
can be crossed by exploring the phenomenon of tunneling 
or multiple mutations (for high mutation rates). Thus, 
the population does not need to move in a Wrightian 
fashion (the whole population moving as a whole across 
the valley). 

The theory discussed herein deals with basic evolution- 
ary concepts which are important to the kind of biological 
examples described above. More complex properties of 
the experimental studies like more general cases of epis- 
tasis and compensatory mutations can easily be incorpo- 
rated, but there is a huge number of possibilities. Even 



if we are only interested in the ordering of fitness values, 
we can have up to 2*^! distinct epistatic patterns. Thus, 
one should rather focus on concrete systems instead. For 
example, one could simulate the dynamics in a system 
with experimentally derived fitness values and mutation 
rates. Not all the paths of a hypercube might be acces- 
sible for selection, but still some of them might prove 
to be significant depending upon the particular values 
of the parameters, such as fitness values and population 
size. Our goal here was to characterize the simplest fea- 
tures of the dynamics of a population crossing a fitness 
valley. This approach can be helpful when more realistic 
scenarios are addressed. 
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